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(54) Method and system for providing alternatives for text derived from stochastic input sources 



(57) A computer-implemented method for providing 
a candidate list of alternatives for a text selection con- 
taining text from multiple input sources, each of which 
can be stochastic (such as a speech recognition unit, 
handwriting recognition unit, or input method editor) or 
non-stochastic (such as a keyboard and mouse). A text 
component of the text selection may be the result of 
data processed through a series of stochastic input 
sources, such as speech input that is converted to text 
by a speech recognition unit before being used as input 
into an input method editor. To determine alternatives 
for the text selection, a stochastic Input combiner parses 
the text selection into text components from different 
input sources. For each stochastic text component, the 
combiner retrieves a stochastic model containing alter- 



natives for the text component If the stochastic text 
component is the result of a series of stochastic input 
sources, the combiner derives a stochastic model that 
accurately reflects the probabilities of the results of the 
entire series. The combiner creates a list of alternatives 
for the text selection by combining the stochastic mod- 
els retrieved. The combiner may revise the list of alter- 
natives by applying natural language principles to the 
text selection as a whole. The list of alternatives for the 
text selection is then presented to the user. If the user 
chooses one of the alternatives, then the word proces- 
sor replaces the text selection with the chosen candi- 
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Description 

FIELD OF THE INVENTION 

[0001] The Invention relates generally to methods 
for entering text Into a computer and, nnore particularly, 
relates to providing alternatives for a text selection 
derived from multiple stochastic Input sources. 

BACKGROUND OF THE INVENTION 

[0002] Computer users have traditionally entered 
text into word processors through a keyboard and 
mouse. In recent years, however word processors have 
become more sophisticated by allowing users to enter 
text into them through other input methods, such as 
speech or handwriting. Although a computer cannot 
always interpret such input with complete accuracy, a 
computer can generate a list of text alternatives for the 
input. Furthermore, the computer can often assign to 
each alternative a probability that the alternative is the 
one the user intended. Input that produces such proba- 
bilistic results is called "stochastic input,* while input 
that can be accurately determined, such as typed text, 
is called "non-stochastic input." 
[0003] Typically, text produced in a word processor 
from stochastic input must be heavily edited by the user 
in order to produce the text Intended by the user when 
he or she created the stochastic input. The editing proc- 
ess has been simplified by allowing the user to select 
text created from stochastic data and request alterna- 
tives for the text selection. In response, the computer 
can provide the user with alternatives for the text selec- 
tion through a graphical user Interface. If the user 
chooses one of the alternatives, the computer replaces 
the text selection with the selected alternative. 
[0004] Suppose that after a user creates text in a 
word processor by providing the word processor with 
stochastic input, such as speech, the userthen edits the 
text. The user may for example, replace a word of the 
text with a new word typed into the computer with a key- 
board. Current word processors do not incorporate 
typed text edits into the alternatives they provide for an 
edited text selection. Thus, there is a need in the art tor 
a method of providing alternatives to edited text derived 
from stochastic input. 

[0005] Another problem occurs If the user attempts 
to request alternatives for a text selection spanning mul- 
tiple stochastic input sources. For instance, the user 
may request alternatives for a text selection containing 
a word based on handwriting input and a word based on 
speech input. Current word processors are not capable 
of providing meaningful alternatives for such a text 
selection. Thus, there is also a need in the art for a 
method of providing alternatives for a text selection 
derived from multiple stochastic input sources. 
[0006] An input method editor (IIVIE) is another word 
processor input method that produces stochastic data. 



Generally, an IME converts input into foreign language 
text The input into an IME may, for example, be typed 
text entered into the computer through a keyboard and 
mouse. An IME is especially useful for creating ideo- 

5 grams in Asian and other languages. Because there are 
many more ideograms in such languages than there are 
keys on a keyboard, entering a particular ideogram into 
the computer typically requires multiples keystrokes, 
which the IME interprets as a composed character. 

10 [0007] In a typical IME, a user may type in English 
characters defining a phonetic spelling for a desired 
Chinese character. Since many Chinese characters 
have similar pronunciations, the typed phonetic spelling 
may represent any one of a number of different Chinese 

15 characters. The IME then provides the user with the 
most probable candidates corresponding to the typed 
phonetic spelling so that the user can choose the cor- 
rect one. 

[0008] Programmers have previously recognized 

20 the value of providing speech input into an IME. This is 
done by first converting the speech into text, which is 
then used as input into the IME. As has already been 
explained, however, the interpretation of speech is sto- 
chastic in nature. Hence, the text produced by the 

25 speech interpreter may not be the text that was 
intended by the user. If incorrect text is used as input 
into the IME, the results produced by the IME are likely 
to be poor. Accordingly, when speech Is used as input 
into an IME, the program interpreting the speech data 

30 typically allows the user to first con-ect the text produced 
by the speech interpreter before inputting that text into 
the IME. When the IME produces foreign language 
translations of the text, the user may again choose the 
desired altematlve because the result of an IME is also 

as stochastic in nature. Requiring the user to edit the 
results at two different stages of the process can be 
inefficient and inconvenient. Thus, there is a further 
need in the art for an improved method of handling 
speech input to an IME. 

40 

SUMMARY OF THE INVENTION 

[0009] The present invention meets the needs 
described above in a stochastic Input combiner that 

45 facilitates the editing of text. The Invention does this by 
providing alternatives for a text selection made by the 
user, even where that text selection is derived from mul- 
tiple input sources, one or more of which can be sto- 
chastic in nature. 

50 [0010] The stochastic input combiner provides the 
alternatives to the user in list form through a graphical 
user interface. The user can then choose one of the 
alternatives to replace the text selection the user has 
highlighted for editing. This can often be quicker than 

55 requiring the user to think of alternatives on his or her 
own and then make changes accordingly using a key- 
board and mouse. If the user does not find an alterna- 
tive the user likes for the text selection, the user can edit 
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the text selection using the keyboard and nnouse. In 
response, the stochastic Input combiner can provide an 
updated list of alternatives that Incorporate the user's 
changes. Often, the user need only partially edit the text 
selection before the stochastic input combiner produces 
an alternative the user likes, so the stochastic Input 
combiner again improves editing efficiency. 
[001 1 ] The stochastic Input combiner may also pro- 
vide the advantage of a natural language model. Such a 
model may analyze the text selection as a whole using 
natural language principles in order to provide a better 
list of alternatives for the text selection. This, too, 
improves the user's editing efficiency. 
[0012] Generally described, the present invention 
Includes a method for correcting text. The user first 
enters text into the computer, perhaps using multiple 
stochastic Input sources. The user may also use a key- 
board and mouse to enter text into the computer. 
[001 3] Keyboard/mouse entry of text is an example 
of an input source which Is non-stochastic, meaning that 
the text intended to be entered through the source can 
be determined with complete accuracy. On the other 
hand, a stochastic Input source is one that converts 
Input into a list of alternatives, each having less than 
100% probability of being the correct alternative. 
Because speech cannot always be Interpreted by a 
computer with complete accuracy, a speech recognition 
unit Is an example of a stochastic input source which 
converts speech input into a list of text alternatives. 
Other examples of a stochastic Input source Include a 
handwriting recognition unit and an input method editor 
(IME). Where an input source for a text component is 
stochastic, the most likely alternative for the text compo- 
nent is generally used to represent the text component 
in the text selection. 

[0014] Once the user enters text into the computer, 
the user can begin the correction process by making a 
text selection of a portion of the text the user entered. 
This text selection can include multiple text compo- 
nents. Such a text component can be a subset of the 
text selection that the user entered through a single 
Input source. The user may have entered different text 
components within the text selection using different 
input sources, and the user may have entered one or 
more of the text components with a stochastic input 
source. 

[0015] Once the user makes a text selection, the 
user may enter a command to display alternatives for 
the text selection as a whole. The stochastic input com- 
biner then parses the text selection into Its text compo- 
nents and retrieves a stochastic model representing the 
alternatives for a text component originating from a sto- 
chastic input source. This stochastic model may include 
a list of the alternatives for the text component together 
with the probabilities associated with the alternatives. 
Alternatively, the stochastic model may include a lattice. 
[0016] The stochastic Input combiner then com- 
bines the stochastic model retrieved with other text 



91 303 A2 4 

components to produce a list of alternatives for the text 
selection as a whole. The stochastic input combiner 
then displays this list of alternatives for the text selection 
on a display device, such as a monitor. The user may 
5 then select one of the displayed alternatives. In that 
case, the selected alternative replaces the text selec- 
tion. 

[0017] The stochastic Input combiner may also uti- 
lize a natural language model. In this alternative, the 

10 stochastic input combiner may combine the stochastic 
models for each stochastic text component to produce 
an Interim list of alternatives for the text selection that 
the combiner provides to the natural language model. 
The natural language model forms a revised list of alter- 

15 natives by reevaluating the interim list of alternatives 
based on natural language principles applied by the nat- 
ural language model to the text selection as a whole. 
The natural language model may also add new alterna- 
tives to the revised list of alternatives that are not found 

20 in the interim list. After the natural language model 
returns the revised list of alternatives to the stochastic 
Input combiner, the stochastic input combiner provides 
the revised list of alternatives for display 
[0018] In another alternative, a series of stochastic 

25 input sources creates a stochastic text component. This 
means that at least one stochastic input source pro- 
duces a stochastic result that serves as input into a sec- 
ond stochastic input source. Typically, the first 
stochastic input source of the series requires user input, 

30 while subsequent stochastic input sources in the series 
receive an alternative produced by the previous sto- 
chastic input source as input. An example of this is a 
speech recognition unit that produces text used as Input 
into an IME. When stochastk; input sources in series 

35 produce a stochastic text component, the stochastic 
Input combiner can produce a series stochastic model 
that contains alternatives for the stochastic text compo- 
nent and accurate probabilities for those alternatives 
that account for the likelihood of the inputs into each sto- 

40 chastic input source in the series. This process elimi- 
nates any need for the user to choose a single 
alternative produced by a stochastic Input source to use 
as input into a subsequent stochastic Input source. 
[0019] To produce a series stochastic model for a 

45 text component derived from a series of stochastic input 
sources, the stochastic input combiner first submits to 
the first stochastic input source In the series the user 
input intended for that stochastic input source. By 
processing the user input, the first stochastic input 

50 source produces a stochastic result. The stochastic 
result that the first stochastic input source produces has 
multiple alternatives, and the stochastic Input combiner 
selects that stochastic result. By using each alternative 
of the selected stochastte result as input into the second 

55 stochastic input source to produce a stochastic result 
for the second stochastic input source, the stochastic 
input combiner produces multiple stochastic results, 
each stochastic result having multiple alternatives, for 
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the second stochastic input source. If any stochastic 
result for the second stochastic input source does not 
contain an "n-best" alternatives, list, the stochastic input 
combiner converts that stochastic result into an "n-best" 
alternatives list because converting all stochastic 
results into the same fonnat simplifies the process of 
combining them. The stochastic input combiner then 
combines the stochastic results for the second stochas- 
tic Input source to create a totalized alternatives list for 
the second stochastic input source. If there are only two 
stochastic Input sources In the series, then the totalized 
alternatives list may serve as the stochastic model for 
the text component resulting from the series. 
[0020] The stochastic Input combiner may also be 
functional for expanding the scope of correction for a 
text selection received from a user to a larger text unit. 
To do this, the stochastic input combiner submits the 
text selection to a correction scope model to make the 
determination of whether the scope of correction should 
be adjusted. In response to submitting the text selec- 
tion, the stochastic input combiner receives from the 
correction scope model a text unit that includes the text 
selection and at least one adjacent word. Using the text 
unit, the stochastic input combiner can then produce a 
list of alternatives for the text unit and display those 
alternatives on a display device. 
[0021 ] The various aspects of the present invention 
may be more clearly understood and appreciated from a 
review of the following detailed description of the dis- 
closed embodiments and by reference to the appended 
drawings and claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0022] 

FIG. 1 is a blocl( diagram illustrating the operating 
environment for an exemplary embodiment of the 
present Invention. 

FIG. 2 is a blocic diagram providing an overview of 
the program modules of a multi-source data 
processing system. 

FIG. 3 is a block diagram that illustrates the opera- 
tion of a typical embodiment of the present inven- 
tion. 

FIGS. 4A-4B are block diagrams that illustrate the 
operation of an embodiment of the present inven- 
tion that allows the user to edit a text selection. 
FIG. 5 is a block diagram that illustrates another 
embodiment of the present Invention in which a nat- 
ural language model is operative. 
FIG. 6 is a flow chart of the steps in a routine for 
processing source data from multiple Input sources. 
FIG. 7 is a flow chart of the steps In a routine for 
determining alternatives for a multi-source text 
selection. 

FIG. 8 is a flow chart of the steps In a routine for 
retrieving stochastic models for the text compo- 



nents in a text selection. 

FIG. 9 Is a flow chart of the steps in a routine for 
deriving a series stochastic model. 
FIG. 1 0 is a flow chart of the steps in an alternative 
5 routine for processing source data that includes 
changing the scope of correction of a text selection. 

DETAILED DESCRIPTION OF THE EXEIVIPUVRY 
EMBODIMENTS 

10 

[0023] The present invention is typically embodied 
In a word processor that can receive input from multiple 
sources, each of which can be a non-stochastic Input 
source or a stochastic Input source. Keyboard/mouse 

15 entry of text is an example of an input source which is 
non-stochastic, meaning that the computer can deter- 
mine the text the user intended with complete accuracy. 
On the other hand, a stochastic input source is one that 
converts input Into a stochastic result. A stochastic 

20 result is one having multiple alternatives, each having 
less than 100% probability of being the correct alterna- 
tive. An example of a stochastic input source is a 
speech recognition unit, which converts speech input 
into a list of text alternatives since a computer cannot 

25 always interpret speech with complete accuracy. Other 
examples of a stochastic input source are a handwriting 
recognition unit and an input method editor (I ME). 
[0024] The word processor Is functional for allowing 
the user to select a section of text and to request aiter- 

30 natives for that selection. If the computer has created 
the text selection from one or more stochastic Input 
sources, there will be alternatives for the text selection. 
After the word processor determines alternatives, for 
the text selection, the word processor displays the alter- 

35 natives through a graphical user interface. If the user 
chooses one of the alternatives for the text selection, 
then the word processor replaces the text selection with 
the chosen candidate. 

[0025] After examining the list of alternatives that 

40 the word processor provides, the user may not find an 
acceptable alternative for the text selection. Hence, the 
word processor may allow the user to edit the text selec- 
tion using a keyboard and mouse. The user may, for 
example, change one of the words in the text selection. 

45 In that case, the word processor may then revise the list 
of alternatives to Incorporate the edit and provide the 
revised list of altematives to the user. If the user 
chooses one of the revised alternatives for the text 
selection, then the word processor replaces the text 

50 selection with the chosen alternative. 

[0026] A program module called a stochastK input 
combiner Is typically responsible for producing the alter- 
natives for a text selection. The stochastic input com- 
biner does this by parsing the text selection Into smaller 

55 text components derived from no more than one sto- 
chastic input source. For each stochastic text compo- 
nent, the stochastic input combiner then retrieves a 
stochastic model representing the alternatives for the 
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text component. Then, the stochastic input combiner 
can combine the stochastic models retrieved with other 
text components to produce a iist of alternatives for the 

text selection as a whole. 

[0027] The stochastic Input combiner can be part of 5 
the word processing application. Alternatively, the sto- 
chastic Input combiner can be a separate Utility that is 
part of the operating system. The combiner could also 
be a separate program that Interfaces with the word 
processor but that is not part of the operating system. io 
[0028] To Improve the list of alternatives the word 
processor offers for a text selection, the word processor 
may use a natural language model. The natural lan- 
guage model may apply natural language principles to 
the text selection as a whole to reevaluate the likelihood is 
of the alternatives In the alternatives list produced by 
the stochastic Input combiner and to add new alterna- 
tives to the alternatives list. Alternatively, the natural lan- 
guage model may apply natural language principles to 
individual text components to improve the stochastic so 
models that the stochastic input combiner uses to pro- 
duce the alternatives list. 

[0029] A text component is occasionally derived 
from a series of stochastic input sources. This means 
that at least one stochastic input source produces a sto- 25 
chastic result that serves as Input into a second sto- 
chastic input source. The first stochastic input source in 
the series typically receives user input, while the last 
stochastic input source in the series produces the alter- 
natives for the text component. The stochastic input so 
combiner can derive a series stochastic model contain- 
ing alternatives for such a text component without 
requiring the user to select a single alternative result for 
each stochastic input source In the series as input for 
the subsequent stochastic input source. 35 
[0030] To derive a series stochastic model for a text 
component, the stochastic Input combiner first selects 
the user input that ultimately produced the text compo- 
nent. The stochastic input combiner then submits the 
selected user input to the first stochastic input source In 40 
series order. The stochastic input combiner then sub- 
mits each alternative produced by the first stochastic 
Input source as input into the subsequent stochastic 
Input source in series order. Because the subsequent 
stochastic input source produces a stochastic result 45 
from each alternative submitted to it, the stochastic 
results of the subsequent stochastic input source must 
be combined Into a totalized candidate list. If there is yet 
another stochastic input source In the series, the sto- 
chastic input combiner submits each candidate of the so 
totalized candidate list as input into that next stochastic 
input source. The totalized candidate list that the last 
stochastic input source of the series produces in this 
fashion is the series stochastic model. 
[0031] Often, stochastic input sources make an 55 
error which spans multiple words. During the con-ection 
process, a user may not notice the full extent of that 
error. For example, if the user dictates the word 'recog- 
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nize," the speech recognition engine might conclude 
that the most lil<ely Interpretation of the speech input Is 
"wrecic a nice." While editing, the user might see the 
word "wreck" and request alternatives only for that word 
because the user did not notice that the following words 
were also incorrect (i.e., "a nice"). 
[0032] If a user makes a text selection that does not 
include adjacent words that are incorrect because of a 
related error and if the word processor only uses the 
user's text selection to produce an alternatives list, none 
of the alternatives offered for the text selection may be 
the text the user Intended at the time of Input. Similarly, 
replacing the text selection with an alternative chosen 
by the user from the alternatives list would leave the 
incorrect adjacent words in the text. 
[0033] To eliminate these disadvantages, the sto- 
chastic input combiner may submit the text selection to 
a correction scope model which detemnines if the scope 
of con-ection should be expanded. In the 'recognize* 
example, an appropriate text unit for con-ection would 
be "wreck a nice." To make this determination, the cor- 
rection scope model may draw on information included 
in a natural language model, models of likely errors, and 
models tied to the input methods used to produced the 
text In the word processor. The models associated with 
the Input methods may include acoustic models for 
speech recognition, handwriting models for handwritten 
Input, and vision-based models for recognizing sign lan- 
guage or other gestures. 

[0034] If the con^ction scope model detennines 
that the scope of con-ection should be adjusted, the cor- 
rection scope model identifies one or more larger text 
units for which the stochastic input combiner should 
produce alternatives in the manner already described. 
The correction scope model sends a list of these text 
units to the stochastic input combiner for processing. 
[0035] Turning now to the figures, in which like 
numerals refer to like elements throughout the several 
figures, aspects of the present invention will be 
described. 

Exemplary Operating Environment 

[0036] FIG. 1 and the following discussion are 
intended to provide a brief and general description of a 
suitable computing environment 100 for an implementa- 
tion of the present invention. The exemplary operating 
environment 100 includes a conventional personal com- 
puter system 120, including a processing unit 121, a 
system memory 122, and a system bus 123 that cou- 
ples the system memory 122 to the processing unit 121. 
The system memory 122 includes read only memory 
(ROM) 124 and random access memory (RAIVI) 125. A 
basic input/output system 126 (BIOS), containing the 
basic routines that help to transfer infonnation between 
elements within the personal computer system 120, 
such as during start-up, is stored In ROM 124. 
[0037] The personal computer system 120 further 
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includes a hard disk drive 127, a magnetic disk drive 
128, e.g., to read from or write to a removable magnetic 
disk 129, and an optical disk drive 130, e.g., for reading 
a CD-ROM disk 131 or to read from or write to other 
optical media. The hard disk drive 127, magnetic disk 
drive 128, and optical disk drive 130 are connected to 
the system bus 123 by a hard disk drive interface 132, a 
magnetic disk drive interface 133, and an optical drive 
interface 134, respectively. The drives and their associ- 
ated computer-readable media provide nonvolatile stor- 
age for the personal computer system 120. Although 
the description of computer-readable media above 
refers to a hard disk, a removable magnetic disk and a 
CD-ROM disk. It should be appreciated by those skilled 
in the art that other types of media that are readable by 
a computer system, such as magnetic cassettes, flash 
memory cards, digital video disks, Bernoulli cartridges, 
and the like, may also the used in the exemplary operat- 
ing environment. 

[0038] A user may enter commands and informa- 
tion into the personal computer 120 through conven- 
tional input devices, including a keyboard 140 and 
pointing device, such as a mouse 142. A microphone 
161 may be used to enter audio input, such as speech, 
into the computer system 120. A user may enter graph- 
ical information, such as drawings or handwriting, into 
the computer system by drawing the graphical inforina- 
tion on a writing tablet 162 using a stylus. The computer 
system 120 may include additional input devices (not 
shown), such as a joystick, game pad, satellite dish, 
scanner, touch screen/stylus, or the like. The micro- 
phone 161 can be connected to the processing unit 121 
through an audio adapter 160 that is coupled to the sys- 
tem bus. The other input devices are often connected to 
the processing unit 121 through a serial port interface 
146 that is coupled to the system bus, but may be con- 
nected by other interfaces, such as a game port or a 
universal serial bus (USB). 

[0039] A monitor 1 47 or other type of display device 
is also connected to the system bus 123 via an inter- 
face, such as a video adapter 148. In addition to the 
monitor, personal computer systems typically Include 
other peripheral output devices (not shown), such as 
speakers or printers. 

[0040] The personal computer system 120 may 
operate in a networked environment using logical con- 
nections to one or more remote computer systems, 
such as a remote computer system 149. The remote 
computer system 149 may be a server, a router, a peer 
device or other common network node, and typically 
includes many or all of the elements described relative 
to the personal computer system 120, although only a 
memory storage device 150 has been Illustrated in FIG. 
1. The logical connections depicted In FIG. 1 Include a 
local area network (LAN) 151 and a wide area network 
(WAN) 152. Such networking environments are com- 
monplace in offices, enterprise-wide computer net- 
works, intranets and the Internet. 



[0041] When used in a LAN networking environ- 
ment, the personal computer system 120 is connected 
to the LAN 151 through a network interface 153. When 
used in a WAN networking environment, the personal 

5 computer system 120 typically includes a modem 154 
or other means for establishing communications over a 
WAN 152, such as the Internet. The modem 154, which 
may be internal or external, is connected to the system 
bus 123 via the serial port interface 146. In a networked 

10 environment, program modules depicted relative to the 
personal computer system 120, or portions thereof, may 
be stored in the remote memory storage device 150. It 
will be appreciated that the network connections shown 
are exemplary and other means of establishing a com- 

,5 munications link between the computer systems may be 
used. It will be further appreciated that the invention 
could equivalently be implemented on host or server 
computer systems other than personal computer sys- 
tems, and could equivalently be transmitted to the host 

20 computer system by means other than a CD-ROM, for 
example, by way of the network connection interface 
153. 

[0042] A number of program modules may be 
stored in the drives and RAM 125 of the computer sys- 

25 tern 120. Program modules control how the computer 
system 120 functions and interacts with the user, with 
I/O devices or with other computers. Program modules 
Include routines, operating system 135, application, 
program modules 138, data structures, browsers, and 

30 other software or firmware components. The invention 
may conveniently be Implemented in one or more pro- 
gram modules, such as a stochastic input combiner pro- 
gram module 137 and a stochastic Input Interface 
program module 139, each of which is based upon the 

35 methods described in the detailed description. 

[0043] The application program modules 138 may 
comprise a variety of applications used in conjunction 
with the present invention, some of which are shown In 
FIG. 2. The purposes of and interactions between some 

40 of these program modules are discussed more fully in 
the text describing FIG. 2. These include a word proces- 
sor program 210 (such as WORD, produced by Micro- 
soft Corporation of Redmond, WA), a handwriting 
recognition program module 230, a speech recognition 

4S program module 240, and an Input method editor (IME) 
250. 

[0044] No particular programming language will be 
described for carrying out the various procedures 
described in the detailed description because it is con- 
so sidered that the operations, steps, and procedures 
described and illustrated in the accompanying drawings 
are sufficiently disclosed to permit one of ordinary skill 
in the art to practice an exemplary embodiment of the 
present invention. Moreover, there are many computers 
55 and operating systems which may be used in practicing 
an exemplary embodiment, and therefore no detailed 
computer program could be provided which would be 
applicable to all of these many different systems. Each 
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user of a particular computer will be aware of the lan- 
guage and tools which are most useful for that user's 
needs and purposes. 

[0045] Those skilled in the art will appreciate that 
the invention may be practiced with other computer sys- s 
tern configurations, including hand-held devices, mutti- 
prooessor systems, microprocessor-based or, 
programmable consumer electronics, minicomputers, 
mainframe computers, and the like. The invention may 
also be practiced in distributed computing environments 10 
where tasks are performed by remote processing 
devices that are linked through a communications net- 
work. In a distributed computing environment, program 
modules may be located in both local and remote mem- 
ory storage devices. is 

Overview of Program Modules 

[0046] FIG. 2 provides an overview of the program 
modules of a multi-source data processing system 200. 20 

Generally, the program modules shown In FIG. 2 enable 
a user to enter text into an application 210, such as a 
worcl processor, using both stochastic and non-stochas- 
tic Input sources. Typical stochastic Input sources 
include a handwriting recognition program module 230, £s 
speech recognition program module 240, input method 
editor (IME) 250, and speech recognition program mod- 
ule 260. A keyboard 140 is a typical source of non-sto- 
chastic data. Once the user enters text into the word 
processor 210 through One or more of these Input 30 
sources, the user may then select a section of text and 
request a candidate list of alternatives for that text 
selection. The text selection may contain input from 
multiple stochastic and non-stochastic input sources. 
As long as the text selection is derived from at least one 35 
stochastic Input source, there will be alternatives for the 
text selection. The program modules can produce this 
candidate list and present them to the user through a 
graphical user interface. If the user chooses one of the 
candidates, the text selection is replaced with the cho- 40 
sen candidate. The operation of stochastic input 
sources 230, 240, 250, and 260 are now discussed in 
turn. 

[0047] The handwriting recognition program mod- 
ule 230 receives handwriting input 280 from the user. 45 
The user may generate the handwriting input 280 by 
writing on the writing tablet 162 with a stylus. Alterna- 
tively, the user may generate handwriting input 280 
using other devices. For instance, the user may write on 
the monitor 1 47 with a mouse 1 42, or the user may write so 
on a touch screen using a stylus. After input, the hand- 
writing input 280 is preferably directed to the handwrit- 
ing recognition program module 230 by a writing tablet 
driver module In the operating system 135. 
[0048] As handwriting is often difficult for a compu- 55 
ter to interpret, the handwriting recognition program 
module 230 cannot always decipher the handwriting 
input 280 with complete accuracy. The best the program 
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module 230 can do is to generate alternatives for the 
handwriting input 280 and assign a probability that each 
alternative is the con-ect one. By definition, then, the 
handwriting recognition program module 230 generates 
a stochastic result. The stochastic model 270a includes 
a data structure containing the stochastic data pro- 
duced t)y processing handwriting input 280 with the 
handwriting recognition program module 230. 
[0049] Although any data structure capable of stor- 
ing stochastic data can comprise a stochastic model 
270, two useful structures for doing so are a lattice and 
an "n-best" alternatives list. A lattice is a structure that is 
well known to those skilled in the art, so a complete 
description will not be given. Briefly, however, a lattice 
stores words or phrases produced by a stochastic input 
source in nodes. Because each word or phrase Is sto- 
chastic data, the node also stores a probability assigned 
to the associated word or phrase. Using methods well 
known to those skilled in the art, the lattice can be tra- 
versed in order to produce likely alternatives for any 
section of text represented by the stochastic data. Fur- 
thermore, lattices representing adjacent pieces of text 
can be combined into a metalattice through a process 
known as concatenation. The metalattice can then be 
traversed to produce alternatives for the adjacent 
pieces of text. 

[0050] Alternatively, stochastic data may be repre- 
sented by a list of the n-best alternatives and their asso- 
ciated probabilities. For any given word or phrase, an n- 
best alternatives list may be produced from a lattice rep- 
resenting the word or phrase. 
[0051] The speech recognition program module 
240 works like the handwriting recognition program 
module 230, except that it receives speech input 290 
from the user through a microphone 1 61 run by a mtero- 
phone driver module in the operating system 135. 
Speech is often difficult to interpret because many 
words that sound alike have different meanings and 
spellings, so the speech recognition program module 
240 also produces a stochastic result. The stochastic 
model 270b stores the data structure containing the sto- 
chastic data produced by processing speech input 290 
with the speech recognition program module 240. 
[0052] An input method editor (IME) 250 also gen- 
erates stochastto data. Generally, an IME 250 converts 
input into foreign language text. The input into an IME 
250 may, for example, be typed text entered into the 
computer through a keyboard 140 and mouse 142. The 
stochastic model 270c includes a data structure con- 
taining the stochastic data produced by the IME 250. 
[0053] An IME 250 is especially useful for creating 
ideograms in Asian and other languages. Because 
there are many more ideograms in such languages than 
there are keys on a keyboard, entering a particular ide- 
ogram into the computer is problematic without an IME 
250. In a typical IME 250, a user types in English char- 
acters a phonetic spelling for a desired Chinese charac- 
ter. Since many Chinese characters have similar 
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pronunciations, tlie typed phonetic spellings may repre- 
sent any one of a number of different Chinese charac- 
ters, and the IME 250 produces a stochastic result. The 
IME 250 then provides the user with the most probable 
candicjates intended by the typed phonetic spelling so s 
that the user can choose the correct one. 
[0054] The stochastic results produced by one sto- 
chastic Input source may serve as stochastic Input to a 
second stochastic input source. When this Is the case, 
the stochastic Input sources are 'series stochastic input w 
sources," and the stochastic input sources can be 
described as configured "in series." This is illustrated by 
the configuration 293 of program modules, which also 
demonstrates another embodiment of an IME 250. 
[0055] In this embodiment, English speech input 15 
262 may be entered into the computer and used to pro- 
duce Japanese text. The speech 262 is first submitted 
to a speech recognition program module 260. In opera- 
tion, the speech recognition program module 260 func- 
tions much like the speech recognition program module 20 
240, but It Is Illustrated as a distinct unit because It may 
have a different speech interpretation engine. For exam- 
ple, the speech recognition program module 260 may 
interpret a different language than the speech recogni- 
tion program module 240. The stochastic model 270d 25 
includes a data structure containing the stochastic data 
produced by processing speech input with the speech 
recognition program module 260. 
[0056] In an English speech/Japanese IME exam- 
ple, the speech recognition program module 260 may 30 
produce English text alternatives from the spoken Eng- 
lish words and store them in the stochastic model 270d. 
One or niore of the English language text alternatives 
stored in the stochastic model 270d can then be used 
as input into the IME 250, which translates the English 35 
language text input into Japanese characters. Each 
alternative input into the IME 250 produces a separate 
stochastic result, though it should be appreciated that 
there may be overlap between the alternatives forming 
the stochastic results of two distinct inputs Into the IME 40 
250. 

[0057] Though the arrow in FIG. 2 from the speech 
recognition program module 260 to IME 250 illustrates 
that the speech recognition program module is a sto- 
chastic input source for the IME 250, it should be under- 45 
stood that the two program modules may not intertece 
directly. Thus, for example, stochastic input, from 
speech recognition program module 260 to IME 250 
may travel through an Interface program module, such 
as stochastic input interface 139, to which each sto- so 
chastic input source is directly connected. 
[0058] A stochastic input interface 139 serves as a 
conduit for stochastic data between an application 210 
that is to receive stochastic data and a stochastic input 
source, such as handwriting recognition program mod- ss 
ule 230, speech recognition program module 240, or 
IME 25Q. One advantage of having a stochastic input 
interface 139 as a conduit for stochastc data is that it 
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simplifies communication between the application 210 
receiving the stochastic data and the stochastic input 
sources. That is, the application only needs to know 
how to communicate with the stochastic input interface 
instead of all possible stochastic input sources. The 
application 210 that is to receive stochastic input is a 
word processor in an exemplary embodiment of the 
present invention. However, the application 210 could 
also be a spreadsheet, browser, electronic mail pro- 
gram, music transcription program, CAD program, pres- 
entation software (such as PowerPoint, produced by 
Microsoft Corporation of Redmond, Washington), oper- 
ating system, or other software program. 
[0059] In the word processor embodiment, the word 
processor 210 receives, though stochastic input inter- 
face 139, text representing the most likely alternative 
from each stochastic Input source used to enter data 
into the word processor. In addition to transmitting data 
Into the word processor 210 through multiple stochastic 
input sources, the user may also enter typical non-sto- 
chastic data into the word processor, such as by typing 
on a keyboard 140. The word processor 210 combines 
all this source data into a multi-source text siring that is 
presented to the user. Although the word processor 210 
does not indicate to the user the source of each word of 
the text, the word processor nonetheless maintains a 
record of the source of each component of the text. 
[0060] The word processor 210 Is also functional for 
allowing the user to identify a section of text and to 
request alternatives for that selection. If the text selec- 
tion is derived from one or more stochastic input 
sources, there will be alternatives for the text selection. 
The word processor 210 can request a candidate list of 
alternatives from the stochastic input interface 139 by 
providing it with the text selection and the sources of 
each of the components of that text selection. After the 
request is processed, the stochastic input interface 139 
provides the word processor 21 0 with a candidate list for 
the entire text selection. The word processor 210 pro- 
vides the candidate list to the user through a graphical 
user interface. If the user chooses one of the alterna- 
tives for the text selection from the candidate list, then 
the word processor replaces the text selection with the 
chosen candidate. 

[0061] In order to process the request for a candi- 
date list of alternatives for a text selection, the stochas- 
tic input interface 139 transmits the request to the 
stochastic input combiner 137. By communicating with 
the stochastic input sources through the stochastic 
input interface 139, the stochastic input combiner 137 
can retrieve infomiation about the stochastic models 
270 needed to produce the candidate list for the text 
selection. 

[0062] To produce the candidate list, the stochastic 
input combiner 137 may optionally consult a natural lan- 
guage model 220. To do so, the combiner 137 first pro- 
duces an interim candidate list of alternatives for the 
text selection using the infomnation retrieved from the 
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stochastic models 270. After the combiner 137 provides 
the interim candidate list to the natural language model 
220, the natural language model analyzes the interim 
candidate list using clues such as grammar, the overall 
meaning of a section of text, and the probability of vari- s 
ous word sequences. Based upon this analysis, the nat- 
ural language model 220 produces additional 
alternatives for the candidate list and reevaluates the 
probabilities of those alternatives In the interim candi- 
date list. The methods used to produce a candidate list io 
of alternatives for a text selection will be described with 
reference to FIGS. 3-9. 

[0083] As shown in FIG. 2, stochastic input sources 
230, 240, and 250 can each provide stochastic data to 
word processor 21 0 without first filtering their stochastic is 
data through another stochastic input source. In other 
words, stochastic input sources 230, 240, and 250 can 
each directly (through stochastic input interface 139) 
transmit stochastic data to the word processor 210, and 
stochastic data from each source can be incorporated 20 
into the same word processing document. For this rea- 
son, they are "parallel stochastic input sources" 296, 
and these stochastic Input sources may be described as 
configured "in parallel." 

[0064] Although the various program modules have 2S 
been described separately, one skilled in the art should 
recognize that the modules could be combined in vari- 
ous ways and that new program modules could be cre- 
ated to accomplish similar results. In particular, the 
stochastic input combiner 1 37 and the natural language so 
model 220 could reside in the stochastic input interface 
139, and all three program modules could be part of the 
operating system 135 or the word processor 210. Also, 
the combiner 137 and the natural language model 220 
could be separate programs that interface with the word 35 
processor 210 directly. Similarly, the stochastic Input 
sources 230, 240, 250, and 260 could be stand-alone 
application program modules 138, or they could be part 
of the operating system 135. 

Graphical Illustration of a Typical Implementation 

[0065] FIGS. 3-5 illustrate what the user sees and 
does in a typical implementation of the present inven- 
tion. Furthermore, these figures show the functionality 45 
of the stochastic input combiner 137 and the natural lan- 
guage model 220. 

[00S6] In FIG. 3, a computer 120 with multiple text 
entry methods accepts input from a user and transmits 
that input to an application 210, such as a word proces- 50 
sor. the computer converts that input into a text string 
300, which it displays on monitor 147. In this example, 
the user intended to produce the text THIS IS A MES- 
SAGE WRITTEN BY A THOUSAND MONKEYS TYP- 
ING AT RANDOM." However, the computer interpreted 55 
the stochastic input as "THIS IS A MESSAGE WRIT- 
TEN BY A TOWN OF MY KEYS TAIPING AT FIANDOM" 
to produce text 300. 
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[0067] Once the text 300 is displayed, the user may 
mal<e a text selection 310 by highlighting a portion of the 
text. The text selection 310 in FIG. 3 has three text com- 
ponents 312, 314, and 316. Each of the text compo- 
nents 312, 314, and 316 comes from a different 
stochastic input source. Thus, for example, text compo- 
nent 312 may be one of the alternatives produced by 
processing handwriting input 280 with the handwriting 
recognition program module 230. The alternatives pro- 
duced by this stochastic Input source are stored in the 
stochastic model 270a. The alternatives list 318 is a list 
of the alternatives for text component 312 stored in the 
stochastic model 270a. Furthermore, the computer has 
chosen TOWN OP for text component 312 because 
the computer determined that that Is the most likely 
alternative In alternatives list 318 for the handwriting 
input 280 that was used to produce the text component 
312. As shown by alternatives list 318, however, the 
computer also recognizes that 'GOWN OF' and 
THOUSAND* are possible alternative phrases for text 
component 312. 

[0068] Similarly, speech recognition program mod- 
ule 240 produces the stochastic result stored in sto- 
chastic model 270b by processing speech input 290. 
The alternatives list 320 contains the alternatives stored 
in stochastic model 270b for text component 314. "MY 
KEYS' has been selected for text component 314 
because it is the most likely alternative. 
P)069] Likewise, text component 316 TAIPING' 
comes from yet a third stochastic input source. This sto- 
chastic input source stores its alternatives in stochastic 
model 270c, and the alternatives are represented in list 
form in alternatives list 322. 

[0070] The stochastic input combiner 137 fornis 
various combinations of alternatives from alternatives 
list 318, 320, and 322. The stochastic Input combiner 
137 then produces a ranked list of the various combina- 
tions it has produced based on its calculation of the 
probability that each combination is the one intended by 
the user for text selection 310. The top ranked alterna- 
tives for text selection 310 are then displayed in alterna- 
tives list 330 on the monitor 147. 
[0071] After the alternatives list 330 for the text 
selection Is displayed, the user may choose to edit the 
text selection, as shown In FIG. 4A. In that figure, the 
user has made edit 410 by typing the word THOU- 
SAND" over the words "TOWN OF" from FIG. 3. As a 
result, text component 312 of text selection 310 is 
replaced with text component 312". 
[0072] Edit 410 may have been accomplished by 
using a keyboard 140 and a mouse 142. Because such 
an entry method is non-stochastic in nature, there are 
no alternatives for text component 312*. This change is 
reflected in alternatives list 318', which has replaced 
alternatives list 318 from FIG. 3. The only alternative 
shown in alternatives list 318' is "THOUSAND." 
[0073] After the edit is completed, stochastic Input 
combiner 137 again fomis various combinations of 



9 



17 

alternatives from alternatives lists 318', 320, and 322 in 
order to form alternatives for the edited text selection 
310. These alternatives are displayed as alternatives list 
430 on monitor 147. 

[0074] If the user desires a different alternative to 
replace text selection 310 than the alternatives dis- 
played in alternatives list 430, the user may again edit 
text selection 310. This is shown in FiG. 4B, with the 
user mal<ing edit 412 by typing the word "IWIONKEYS" 
over the phrase 'MY KEYS* that appeared in text selec- 
tion 310 in FIG. 4A. As a result, text component 314", 
"MONKEYS," replaces text component 314, "MY 
KEYS." This further results in the replacement of alter- 
natives list 320 with alternatives list 320", which contains 
only one alternative because an edit is non-stochastic in 
nature. Once again, the stochastic input combiner 137 
combines alternatives from the various alternatives lists 
318', 320', and 322 in order to produce alternatives list 
430' for text selection 310. 

[0075] At that point, the user may find an alternative 
414 which he wishes to replace text selection 310. if so, 
he may highlight and choose alternative 414, in which 
case alternative 414 replaces the text selection 310 to 
produce new text 416. 

[0076] FIG. 5 Is similar to FIG. 3, except that FIG. 5 
illustrates an embodiment of the invention in which the 
natural language model 220 is operative. As in FIG. 3, a 
usermalces text selection 310 comprised of text compo- 
nents 312, 314, and 316. The stochastic Input combiner 
137 then fornis various combinations of alternatives for 
the text components to produce an interim list of alter- 
natives for text selection 310. Instead of displaying the 
most probable alternatives thus derived, the stochastic 
input combiner 137 then passes the interim list of alter- 
natives for the text selection 310 to a natural language 
model 220. 

[0077] The natural language model 220 then re- 
evaluates the probabilities of the various alternatives in 
the interim alternatives list based on natural language 
principles applied by the natural language model to the 
text selection 310 as a whole. This includes analysis of 
grammatical and other language clues In the text selec- 
tion. The natural language model 220 may also fonn 
additional alternatives for the text selection 310 not 
found in the interim list of alternatives provided to the 
natural language model 220. The natural language 
model 220 returns to the stochastic input combiner 137 
a revised list of alternatives based on the re-evaluated 
alternatives in the interim alternatives list and the addi- 
tional alternatives it has produced. The stochastic input 
combiner 137 then chooses the top ranked alternatives 
for display in alternatives list 530. 
[0078] Because of the operation of the natural lan- 
guage niodel, the alternatives list 530 is hopefully better 
than alternatives list 330 of FiG. 3. If that is the case, the 
user may choose an alternative 502 from alternatives 
list 530 without having to edit the text selection 310. In 
the example shown in FIG. S, new text 504 is formed by 
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the user replacing text selection 310 with alternative 
502. 

Flow Charts for a Ty pical Implementation 

5 

[0079] FIG. 6 is a flow chart of the steps in a typical 
routine 600 for processing source data. This routine 
illustrates the steps that implement the embodiments of 
the invention described with respect to FIGS. 3-5. The 

10 routine 600 begins at step 602 with the word processor 
receiving source data from multiple input sources. An 
input source may be a stochastic input source, such as 
handwriting recognition program module 230, speech 
recognition program module 240, or input method editor 

15 250. An input source could also be non-stochastic, such 
as typed data entered using a keyboard 140 and mouse 
142. Furthermore, some of the source data may come 
from two or more stochastic input sources configured in 
series, if data is derived from stochastic input sources 

2D configured in series, each stochastic input source may 
be counted as a different input source. 
[0080] After the word processor receives source 
data from multiple input sources, the word processor 
combines that data into a multi-soun:e text string in step 

25 604. This means that the word processor creates text 
corresponding to the source data so that the word proc- 
essor can display the text on the monitor 147. Further- 
more, word processor creates a data structure to keep 
track of the source of each word of the text. 

30 [0081] In step 606, a user may make a text selec- 
tion including a portion of the displayed text. This text 
selection may include text from multiple input sources. 
The user may, for example, make the text selection by 
depressing a mouse button at the beginning of the text 

35 selection, dragging the mouse to the end of the desired 
text selection, and then releasing the mouse button. 
Preferably, the word processor highlights the text selec- 
tion to indicate what has been selected. 
[0082] In step 608, the word processor receives a 

40 "display alternatives" command for the text selection. In 
response, the word processor detemnines alternatives 
for the multi-source text selection in step 610. 
[0083] In step 612, the word processor displays 
those alternatives on the monitor 1 47. The word proces- 

45 sor preferably displays the alternatives In probabilistic 
order through a graphical user interface that allows the 
user to make a selection from the displayed alterna- 
tives. The graphical user interface may appear in a sub- 
window that the user can move around so as to reveal 

50 text hidden by the sub-window. 

[0084] In step 614, the user gives the word proces- 
sor a command. Examples of possible commands 
include the selection of a displayed alternative, an 
attempt to edit the text selection, or an attempt to make 

55 a new text selection by depressing the mouse button to 
anchor the mouse at a point of the text outside of the 
text selection. 

[0085] In step 616, the word processor determines 



EP 1 091 303 A2 



10 



19 

if the user has selected a displayed alternative. If the 
user has selected a displayed alternative, then step 618 
is perfonned, in which the word processor replaces the 
text selection with the selected alternative. After step 
618, the word processor discontinues the display of 
alternatives in step 624 before the routine ends at step 
626. After step 626, the routine may be repeated by a 
return to step 602. 

[0086] Returning to step 616, if the user has not 
selected a displayed alternative, then step 620 is per- 
formed. In step 620, the word processor detennines if 
the user is editing the text within the text selection. If the 
user is editing the text within the text selection, then the 
word processor processes that edit in step 622. After 
the word processor completes the edit, the routine loops 
back to step 610 to detemiine new alternatives for the 
edited text selection. 

[0087] Returning to step 620, if the user command 
received in step 614 was not an edit command within 
the text selection, then the word processor performs 
step 624. In this case, the user has initiated the creation 
of a new text selection by depressing the mouse outside 
of the old text selection. Hence, the word processor dis- 
continues the display of alternatives in step 624 before 
the routine ends at step 626. Once again, the routine 
may be repeated by returning to step 602. 
[0088] FIG. 7 details the steps of routine 610 from 
FIG. 6. The routine describes the steps for determining 
alternatives for a multi-source text selection. Typically, a 
stochastic input combiner performs this routine. This 
combiner may be a program module in the word proces- 
sor, a separate utility in the operating system, or a sep- 
arate program that interfaces with the word processor. 
[0086] The routine begins with step 702, in which 
the stochastic input combiner parses the text selection 
into text components originating from different input 
sources. To do so, the stochastic Input combiner con- 
sults the data structure that stores the source of each 
word in the text string displayed on the monitor 147. 
Parsing the text selection into text components makes 
the process of determining alternatives for the text 
selection more manageable. 

[0090] One skilled in the art should recognize that 
multiple definitions of a text component are possible. 
Using a different definition than the embodiment 
described in FIGS. 7-9 will require a different parsing 
step 702, as well as other appropriate modifications of 
the routines described therein. A text component could, 
lor example, be a single word. Or, a text component 
could be a phrase composed of words originating from 
the same Input source. In the latter case, a phrase 
derived from a stochastk: input source could be a differ- 
ent text component than an edit inserted into the middle 
of that phrase. 

[0091] FIGS. 7-9 illustrate an example in which a 
text component Is defined as the largest unit of text 
derived from a different input source or series of sto- 
chastic input sources than its neighbors, together with 
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any typed text that has edited that text component. A 
unit of typed text that is not an edit of a text component 
originating from a stochastic input source is considered 
Its own text component For example, typed text a user 
5 has inserted between text components originating from 
different stochastic input sources is considered its own 
text component. 

[0092] In step 704, the stochastic input combiner 
retrieves a stochastic model for each text component 
10 originating from a stochastic input source. In step 706, 
the stochastic input combiner detennines if all stochas- 
tic models are lattices. If all stochastic models are lat- 
tices, then step 708 is perfonned. 
[0093] In step 708, the stochastic input combiner 
»5 concatenates all the lattices retrieved into a metalattice. 
In order to create the metalattice, the stochastic input 
combiner creates nodes to incorporate any typed text 
components for typed text that is not incorporated into a 
text component from a stochastic input source. Using 
20 lattice traversal methods well known to those skilled in 
the art, the stochastic input combiner traverses the met- 
alattice to form a list of alternatives for the text selection 
In step 710. After step 710, control passes to step 718, 
which will be discussed after the "NO" path from step 
25 706 is described. 

[0094] Returning to step 706, if all stochastic mod- 
els retrieved in step 704 are not lattices, then the sto- 
chastic Input combiner performs step 712. In this case, 
at least one of the stochastic models is an "n-best" can- 
so didate list. So, the stochastic input combiner converts 
each of the lattices into an *n-best" candidate list. This 
is necessary for the stochastic input combiner to per- 
form step 71 4. 

[0095] In step 714, the stochastic input combiner 
35 combines the *n-best" candidate list for each of the text 
components with typed text components in order to 
fbmri a combined list of alternatives for the text selection. 
The stochastic input combiner does this by forming all 
possible combinations of aiternatives, one from each °n- 
40 best" candidate list for a text component. For each com- 
bination, the stochastic input combiner arranges the 
alternatives for the text components in the order in 
which the text components appear in the text selection. 
The list of all an^gements thus formed comprises the 
45 list of alternatives for the text selection. After step 714, 
control passes to step 716. 

[0096] Once control passes to step 71 6 from either 
710 or 714, the stochastic input combiner has formed a 
list of alternatives for the text selection. Step 716 is an 

50 optional step in which a natural language model is oper- 
ative. If this step is performed, the stochastic input com- 
biner submits the list of alternatives for the text selection 
to the natural language model. 
[0097] If optional step 716 is perfonned, then 

55 optional step 718 is also performed. In that step, the 
natural language models returns a ranked list of revised 
alternatives to the stochastic input combiner. The 
revised list includes a re-evaluation of the probabilities 
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of the alternatives in the list of alternatives submitted to 
the natural language model. The revised list may also 
contain new altematives for the text selection that the 
natural language model formed. The natural language 
model creates this revised list using natural language 5 
principles, including an analysis of grammatical and 
other language clues that the natural language model 
applies to the text selection as a whole. 
[0098] In step 720, the stochastic input combiner 
chooses the top ranked alternatives for display. If 10 
optional steps 716 and 718 were perfomned, then the 
stochastic input combiner chooses these alternatives 
from the revised list received from the natural language 
model. If not, then the stochastic input combiner 
chooses these altematives from the list of alternatives 15 
that were created by either step 710 or step 714. After 
step 720, the routine ends at step 722. 
[0099] FIG. 8 shows the steps of routine 704 on 
FIG. 7. The routine illustrates the steps the stochastic 
input combiner follows to retrieve a stochastic model for 20 
each text component of the text selection originating 
from a stochastic input source. The routine begins at 
step 802, which effectively fonns a loop for processing 
each text component. In step 802, the stochastic input 
combiner retrieves a text component. In step 804, the ss 
stochastic input combiner detennines if that text compo- 
nent is a stochastic text component. If the retrieved 
component is not a stochastic text component, then 
step 806 is performed. In this case, the text component 
typically comprises typed text entered using a Iceyboard 30 
and mouse. Because the text component is non-sto- 
chastic, the stochastic input combiner assigns the text 
component a 100% probability. The stochastic input 
combiner then performs step 818, which shall be dis- 
cussed shortly. 35 
[0100] Returning to step 804, if the stochastic Input 
combiner determines that the text component retrieved 
in step 802 is stochastic, then the stochastic input com- 
biner performs step 808. In step 808, the stochastic 
input combiner determines if the text component is to 
derived from stochastic models configured in series. If 
the text component is derived from a series of stochas- 
tic input sources, then the stochastic input combiner 
performs routine 81 0 in order to derive a series stochas- 
tic model that accurately represents the probabilities of 45 
the results produced by the last stochastic input source 
of the series. After routine 810, the stochastic input 
combiner performs step 812. Lil<ewise, if the stochastic 
input combiner detenmines in step 808 that the text 
component retrieved in step 802 is not derived from a so 
series of stochastic models, step 812 is performed. 
[0101] In step 812, the stochastic input combiner 
determines if the user has edited the text component 
using a i<eyboard and mouse. If the text component has 
been edited, then the stochastic input combiner updates 55 
the con^sponding stochastic model in step 814. If the 
stochastic model is a lattice, then updating it will include 
deleting ^ny nodes corresponding to words that have 
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been deleted from the text component. Furthermore, 
the stochastic input combiner must add nodes for new 
words within the text component. Similarly, if the sto- 
chastic model is an "n-best" candidate list, the stochas- 
tic input combiner must update each alternative of the 
list to remove words that have been deleted from the 
text component and add words that have been inserted 
into the text component 

[01 02] After step 81 4, the stochastic input combiner 
perfomis step 816. The stochastic input combiner also 
perfonns step 816 if it determines in step 812 that the 
user has not edited the text component. In step 816, the 
stochastic input combiner retrieves a stochastic model 
result for the text component that was selected in step 
802. If the text component was derived from a series of 
stochastic models, then the stochastic model retrieved 
is the series stochastic model produced in step 810 or, 
if the text component has been edited, the series sto- 
chastic model that was updated in step 814. The sto- 
chastic model retrieved may be a lattice or an "n-best" 
candidate list. The stochastic model retrieved need only 
contain information about the selected text component, 
so the stochastic input combiner may retrieve the sto- 
chastic model from a larger stochastic model for a 
selection of text that includes the text component. 
[0103] The text component that was selected in 
step 802 may be derived from stochastic input, but a 
stochastic model representing alternatives for that text 
component may be unavailable. In that case, the text 
component can be treated the same as a non-stochas- 
tic text component. In other words, the stochastic input 
combiner assigns the l<nown alternative for the text 
component a probability of 100%. After step 816, the 
stochastic input combiner perfomis step 818. 
[0104] Step 818 can be reached from either step 
816 or step 806. In this step, the stochastic input com- 
biner determines if there are any more text components 
in the text selection to process. If there are any more 
text components, then the routine loops to step 802 so 
the stochastic input combiner can get and process the 
next text component. 

[0105] When there are no more text components to 
process in step 818, the stochastic input combiner 
optionally perfonns step 820 for incorporating the natu- 
ral language model. In this step, the stochastic input 
combiner passes each stochastic model retrieved for a 
text component to the natural language model. The nat- 
ural language model applies natural language principles 
to the text components and returns them to the stochas- 
tic input combiner. Because the natural language model 
operates on individual text components in step 820, 
instead of on the entire text selection, step 820 may be 
performed either instead of steps 716 and 718, or In 
addition to those steps. After step 820, the routine ends 
at step 822. 

[0106] FIG. 9 illustrates the steps of routine 810 on 
FIG. 8. This routine describes the steps the stochastic 
input combiner follows to derive a series stochastic 
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model for a text component produced by stochastic 
input sources configured in series. 
[0107] The routine 810 begins with step 902, which 
effectively begins a loop for processing in series order 
each of the stochastic input sources, except the last sto- s 
chastic input source of the series. The first time the sto- 
chastic input combiner performs step 902, the 
stochastic Input combiner selects the first stochastic 
input source in series order. This Is the stochastic input 
source that receives the user Input that ultimately pro- w 
duces the text component. 

[0108] Because a stochastic input source produces 
multiple alternative results, the first stochastic Input 
source produces multiple candidates for input into the 
second stochastic input source in the series. If the sto- is 
chastic input combiner is not perfonning step 902 for the 
first time, then the stochastic input combiner will have 
produced a totalized candidate list In step 914 (to be 
described shortly) for the stochastic Input source 
selected In step 902. In the latter case, the totalized 20 
candidate list contains the alternatives associated virith 
the selected stochastic Input source that are to be used 
as input Into the subsequent stochastic input source of 
the series. Step 904 effectively begins a loop for 
processing all of the candidates associated with the 2S 
selected stochastb Input source. In step 904, the sto- 
chastic Input combiner retrieves one of the candidates 
for the selected input source. 

[0109] In step 906, the stochastic input combiner 
submits the candidate retrieved In step 904 as input into 30 
the subsequent stochastic input source in series order 
Inputting this candidate into the subsequent stochastic 
Input source produces a stochastic result because the 
subsequent source is also stochastic. The stochastic 
input combiner retrieves this stochastic result as 
[0110] In step 908, the stochastic Input combiner 
determines if the stochastic result retrieved In step 906 
is a lattice. If the stochastic result retrieved in step 906 
is not a lattice, then it is a ranked candidate list and step 
912 (to be discussed shortly) is performed. If the sto- to 
chastic result retrieved in step 906 is a lattice, then the 
stochastic input combiner must convert the lattice into a 
ranl<ed candidate list of alternatives, with each alterna- 
tive having an associated probability. This is done in 
step 91 0 before control passes to step 91 2. 45 
[0111] In step 912, the stochastic input combiner 
determines if there is another candidate for the selected 
source. If there is another candidate for the selected 
source, then the routine loops back to step 904 so that 
the stochastic input combiner can get the next candi- so 
date. If there is not another candidate for the selected 
source, then step 914 is performed. 
[0112] In step 914, the stochastic Input combiner 
combines all the candidate lists produced by using can- 
didates from the input source selected in step 902 as ss 
input into the subsequent stochastic input source in 
series order. This combination forms a totalized candi- 
date list for the subsequent stochastic input source. The 
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stochastic Input combiner forms the totalized candidate 
list by maldng one entry for each unique candidate in 
the candidate lists being combined. Then, the stochas- 
tic input combiner calculates a probability for each alter- 
native in the totalized alternatives list by summing all 
probabilities assigned to that alternative In each of the 
candidate lists being combined. The stochastic Input 
combiner assigns each calculated probability to its 
associated alternative. 

[0113] In step 916, the stochastic Input combiner 
determines If there Is another source in the series after 
what is currently the subsequent source, if there is 
another source in the series, then the stochastic input 
combiner selects what is currently the subsequent 
source in step 902, and the next source after what had 
been the subsequent source becomes the subsequent 
source. At this point, the stochastic input combiner 
chooses candidates from the totalized candidate list for 
the selected input source as Input Into the subsequent 
stochastic input source. 

[0114] Returning to step 916, if there is not another 
source in the series after the subsequent source, then 
the routine ends at step 918. The totalized candidate list 
most recently created In step 914 Is selected as the 
series stochastic model. 

[0115] FIG. 10 is a logical flow diagram illustrating 
typical steps of an alternative embodiment 1000 of a 
source data processing routine. Generally, the routine 
provides for automatically adjusting the unit of text cor- 
rected In response to a user's text selection. 
[01 16] Routine 1000 begins with step 1005. In that 
step, the word processor 210 receives a text selection 
from the user which the user wants to correct. The user 
may specify the text selection by selecting the word or 
group of words comprising the text selection with the 
mouse 142. Alternatively, the user may specify a text 
selection consisting of a single word by using the mouse 
142 to place the insertion point In or adjacent to the 
word. The word processor 210 may then submit the text 
selection to the stochastic Input combiner 137 to deter- 
mine correction alternatives. 

[0117] In step 1010, the stochastic input combiner 
137 submits the text selection to a conrection scope 
model to determine if the scope of correction should be 
adjusted. Typically, adjusting the scope of correction 
involves identifying a text unit that will provide better text 
correction alternatives to the user than the text selection 
alone. For Instance, the text selection may not Include 
neighboring words that also contain errors which could 
be corrected together with the text selection. Usually, 
such en-ors in words neighboring a user's text selection 
are identifiable because they relate to errors in the text 
selection. 

[0118] Accordingly, a text unit identified by the cor- 
rection scope model may include the text selection plus 
one or more adjacent words, instead of identifying only 
a single text unit for possible correction, the correction 
scope model may identify multiple text units, each of 
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which are likely to yield good alternatives for text correc- 
tion. 

[0119] In step 1015, the stochastic input combiner 
137 receives from the correction scope model a list of 
text units for which correction alternatives should be s 
provided to the user. If the correction scope model 
determined that the scope of correction need not be 
adjusted, then the list of text units includes only the text 
selection. If the correction scope model identified only 
one text unit for correction, the list of text units need io 
include only that one text unit. 

[0120] Step 1020 effectively begins a loop for 
processing each of the text units identified in the list of 
text units that the con-ection scope model returned to 
the stochastic input combiner 137 in step 1015. In step is 
1020, the combiner 137 selects a text unit for process- 
ing. In step 1025, the combiner 137 perfomns the steps 
of the routine of FIG. 7 in order to determine alternatives 
for the selected text unit One should understand that 
when the combiner 137 performs the routine 1025 by 20 
performing the steps described in FIG. 7, "text selec- 
tion" as used in FIG. 7 refers to the selected text unit. 
[0121] In step 1030, the stochastic input combiner 
137 detennines if there are any more text units to proc- 
ess. If there are more text units, the routine loops back 2S 
to step 1 920 along the 'YES* branch to process the next 
text unit. If there are no more text units, the "NO" branch 
is followed to step 1035. 

[0122] In step 1035, the stochastic input combiner 
137 provides each of the correction alternatives and 30 
their associated text units to the word processor 210 for 
display. The word processor 210 may display these 
alternatives in any appropriate manner. If the scope of 
correction was not adjusted, the alternatives may be 
displayed as described with respect to FIG. 6. If the 3S 
combiner 137 expanded the scope of correction to a 
single text unit, the word processor 137 may highlight 
the additional words in the text to which the scope of 
correction was expanded in a different color than the 
color used to highlight the text selection, and the word to 
processor may present the alternatives for the text unit 
in a typical graphical user interface as described with 
respect to FIG. 6. 

[0123] Suppose that the correction scope model 
Identified multiple text units for conrection. In that case, 45 
the word processor 210 may present the user with a 
menu of alternatives and Identify the corresponding text 
unit for each alternative. 

[0124) After the word processor 210 presents the 
conrection alternatives to the user through a graphical so 
user interface, the routine ends at step 1040. The word 
processor 210 can then process the user's response to 
the alternatives as described with respect to FIG. 6. 

Conclusion 55 

[0125] Other alternative embodiments will become 
apparent to those skilled in the art to which an exem- 



plary embodiment pertains without departing from its 
spirit and scope. Accordingly, the scope of the present 
invention is defined by the appended claims rather than 
the foregoing description. 

Claims 

1. A computer-implemented method for con-ecting 
text, comprising the steps of: 

receiving a text selection comprising a plurality 
of text components derived from different input 
sources; 

at least one of the text components comprising 
a stochastic text component derived from a sto- 
chastic input source or a series of stochastic 
input sources; 

receiving a command to display alternatives for 
the text selection; 

parsing the text selection into the text compo- 
nents; 

retrieving the stochastic model for the stochas- 
tic text component from its associated stochas- 
tic input source or series of input sources; 
combining the stochastic model with other text 
components to produce a list of alternatives for 
the text selection; and 

displaying the list of alternatives for the text 
selection on a display device. 

2. The method of claim 1 , further comprising the steps 
of: 

receiving a user command selecting one of the 

displayed altematives; and 

replacing the text selection with the selected 

alternative. 

3. The method of claim 1 , further comprising the steps 
of: 

receiving an edit to the text selection; 
producing a revised list of alternatives for the 
edited text selection; and 
displaying the revised list of alternatives for the 
edited text selection. 

4. The method of claim 1 , further comprising the steps 

of: 

receiving an edit to one of the stochastic text 
components; 

retrieving a revised stochastic model for the 
edited stochastic text component from its asso- 
ciated stochastic input source or series of input 
sources; 

combining the revised stochastic model with 
another stochastic model associated with the 
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text selection to produce a revised list of alter- 
natives for the edited text selection; and 
displaying the revised list of alternatives for the 
edited text selection. 

5 

5. The method of claim 1, wherein the text selection 
comprises a portion of text in a file within an appli- 
cation selected from the group consisting of a word 
processor, a spreadsheet, a browser, an electronic 
mail program, a music transcription program, a io 
CAD program, a presentation program, and an 
operating system. 

6. The method of claim 1 , wherein the step of display- 
ing the alternatives for the text selection further is 
comprises the steps of: 

ranking the alternatives for the text selection in 
probability order; and 

displaying the alternatives in their ranl< order on 20 
the display device. 

7. The method of claim 6, wherein the step of display- 
ing the alternatives in their rank order further com- 
prises the steps of: 25 

selecting a pre-determlned number of highest 
ranked alternatives; and 
displaying the selected alternatives in their 
rank order on the display device. 30 

8. The method of claim 1, wherein the text selection 
comprises a plurality of stochastic text components 
and the step of combining the stochastic models 
further comprises the steps of: 35 

combining the stochastic models for each sto- 
chastic text component to produce an interim 
list of alternatives for the text selection; 
providing the interim list of alternatives to a nat- 40 

ural language model; 

receiving a revised list of alternatives for the 
text selection from the natural language model, 
the revised list of alternatives comprising a 
reevaluation of the Interim list of alternatives 45 
based on natural language principles applied 
by the natural language model to the text selec- 
tion as a whole; and 

displaying the revised list of alternatives as the 
list of alternatives for the text selection. so 

9. The method of claim 8, wherein the revised list of 
alternatives also comprises additional alternatives 
formed by the natural language model that are not 
found in the interim list of alternatives provided to 55 
the natural language model. 

10. The method of claim 8, further comprising the step 



303 A2 28 

of providing the stochastic model retrieved for one 
or more stochastic text components to a natural 
language model for reevaluation based on natural 

language principles. 

11. The method of claim 8, further comprising the step 
of providing the stochastic model for each stochas- 
tic text component to the natural language model 
for use in creating the revised list of alternatives. 

12. The method of claim 1, wherein the text selection 
comprises a plurality of stochastic text components 
and the stochastic models for the text components 
comprise lattices, and wherein the step of combin- 
ing the stochastic models to produce a list of alter- 
natives for the text selection further comprises the 
steps of: 

concatenating the lattices into a metalattice 
that includes infonnation about any text compo- 
nents that are derived from a non-stochastic 
source; and 

producing the list of alternatives for the text 
selection from the metalattice. 

13. The method of claim 1, wherein the text selection 
comprises a plurality of stochastic text components 
and one of the stochastic models comprises an "n- 
besf candidate list and another stochastic model 
comprises a lattice, and wherein the step of com- 
bining the stochastic models to produce a list of 
alternatives for the text selection further comprises 
the steps of: 

creating an 'n-best* candidate list con-espond- 
Ing to the lattice; and 

producing the list of alternatives for the text 
selection by combining the °n-besf candidate 
lists for the text components. 

14. The method of claim 1, wherein the step of retriev- 
ing a stochastic model for a text component origi- 
nating from a stochastic input source further 
comprises the steps: 

detemiining if the text component is derived 
from stochastic input sources configured in 
series; 

if the text component is derived from stochastic 
input sources configured in series, deriving a 
series stochastic model by combining together 
a stochastic model from each stochastic input 
source in the series; 

retrieving the series stochastic model as the 
stochastic model for the text component. 

15. The method of claim 1 , wherein the step of retriev- 
ing the stochastic model for the stochastic text com- 



15 



29 



EP 1 091 303 A2 



30 



ponent from its associated stociiastic input source 
or series of Input sources comprises the steps of: 

receiving user input into a first stociiastic input 
source In a series of stociiastic input sources; 
selecting a stociiastic result comprising a plu- 
rality of alternatives produced by the first sto- 
chastic input source; 

producing a plurality of stochastic results for a 
second stochastic input source in the series by 
using each alternative of the stochastic result 
produced by the first stochastic input source as 
input into the second stochastic input source to 
produce a stochastic result for the second sto- 
chastic input source; 

if any stochastic result for the second stochas- 
tic input source does not comprise an "n-best" 
alternatives list, converting that stochastic 
result to an "n-best" alternatives list; 
combining the stochastic results for the second 
stochastic input source to create a totalized 
alternatives list for the second stochastic input 
source. 

16. The method of claim 15, wherein the step of com- 
bining the plurality of stochastic results for the sec- 
ond stochastic input source to create the totalized 
alternatives list for the second stochastic input 
source further comprises the steps of: 

creating a single entry In the totalized alterna- 
tives list for each unique alternative appearing 
in the plurality of stochastic results for the sec- 
ond stochastic input source; 
calculating a probability lor each alternative in 
the totalized alternatives list by summing ail 
probabilities assigned to that alternative in the 
plurality of stochastic results for the second 
stochastic input source; and 
assigning each calculated probability to its 
associated alternative. 

17. A computer-readable medium having computer- 
executable instructions for performing the method 
of claim 1 . 

1 8. A computer adapted to perform the method of claim 



19. A computer-readable medium having computer- 
executable instructions for performing the method 
of claim 1 6. 

20. A computer adapted to perform the method of claim 
16. 

21. A computer-implemented method for correcting 
text, comprising the steps of: 



receiving a text selection from a user; 
receiving a command to display alternatives for 
the text selection; 

submitting the text selection to a correction 
5 scope model to determine If a scope of correc- 

tion should be adjusted; 
receiving from the con-ectlon scope model a 
text unit that Includes the text selection and at 
least one adjacent word; 
10 producing a list of alternatives for the text unit; 

and 

displaying the list of alternatives for the text unit 
on a display device. 

15 22. The method of claim 21, further comprising the 
steps of: 

receiving a user command selecting one of the 
displayed alternatives; and 
20 replacing the text unit with the selected alterna- 

tive. 

23. The method of claim 21 , wherein the adjacent word 
is incorrect because of a related error that caused a 

25 word within the text selection to be incorrect. 

24. The method of claim 21, wherein the correction 
scope model includes criteria selected from the 
group consisting of a natural language model, a 

30 model of likely errors, an acoustic model, a hand- 
writing model, and a vision-based model. 

25. The method of claim 21 , wherein the step of pro- 
ducing a list of alternatives for the text unit further 

35 comprises the steps of: 

parsing the text unit into text components 
derived from different Input sources; 
determining if one of the text components com- 
40 prises a stochastic text component; 

If the one of the text components comprises the 
stochastic text component, retrieving a sto- 
chastic model for the stochastic text compo- 
nent; and 

45 combining the stochastic model with other text 

components to produce a list of alternatives for 
the text unit. 

26. A computer-readable medium having computer- 
50 executable Instructions for performing the method 

of claim 21. 

27. A computer adapted to perform the method of daim 
25. 

55 

28. A computer for correcting text, comprising: 

means for receiving a text selection comprising 
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a plurality of text components derived from dif- 
ferent input sources; at least one stochastic 
input source from wliich is derived a stochastic 
text component, at least one of the text compo- 
nents derived from different Input sources com- 5 
prising the stochastic text component; 

means for receiving a command to display 
alternatives for the text selection; 

10 

passing means for passing the text selection 
into the text components; retrieving means for 
retrieving the stochastic model for the stochas- 
tic text component from its associated at least 
one stochastic input source; is 

means for combining the stochastic model with 
other text components to produce a list of alter- 
natives for the text selection; 

20 

display means for displaying the list of alterna- 
tives for the text selection. 
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